136 research outputs found

    Unsupervised Induction of Modern Standard Arabic Verb Classes

    Get PDF
    We exploit the resources in the Arabic Treebank (ATB) for the novel task of automatically creating lexical semantic verb classes for Modern Standard Arabic (MSA). Verbs are clustered into groups that share semantic elements of meaning as they exhibit similar syntactic behavior. The results of the clustering experiments are compared with a gold standard set of classes, which is approximated by using the noisy English translations provided in the ATB to create Levin-like classes for MSA. The quality of the clusters is found to be sensitive to the inclusion of information about lexical heads of the constituents in the syntactic frames, as well as parameters of the clustering algorithm. The best set of parameters yields an Fβ=1 score of 0.501, compared to a random baseline with an Fβ=1 score of 0.37

    Locality and Accessibility in Wh-Questions

    Get PDF
    Even in relatively configurational languages, such as English, speakers frequently have a choice between different constituent orders. Many of these word order variations have been linked to complexity (Hawkins 2005; inter alia). For example, heavy-NP shift is more likely if the shifted NP is more complex than the NP it shifts over (Wasow 1997). Other cases of word order variations, however, have not been considered in these terms. The choice between different wh-phrase orders, as in (1), has been said to be determined by (categorical) grammatical constraints, such as Superiorit

    ACI-BENCH: a Novel Ambient Clinical Intelligence Dataset for Benchmarking Automatic Visit Note Generation

    Full text link
    Recent immense breakthroughs in generative models such as in GPT4 have precipitated re-imagined ubiquitous usage of these models in all applications. One area that can benefit by improvements in artificial intelligence (AI) is healthcare. The note generation task from doctor-patient encounters, and its associated electronic medical record documentation, is one of the most arduous time-consuming tasks for physicians. It is also a natural prime potential beneficiary to advances in generative models. However with such advances, benchmarking is more critical than ever. Whether studying model weaknesses or developing new evaluation metrics, shared open datasets are an imperative part of understanding the current state-of-the-art. Unfortunately as clinic encounter conversations are not routinely recorded and are difficult to ethically share due to patient confidentiality, there are no sufficiently large clinic dialogue-note datasets to benchmark this task. Here we present the Ambient Clinical Intelligence Benchmark (ACI-BENCH) corpus, the largest dataset to date tackling the problem of AI-assisted note generation from visit dialogue. We also present the benchmark performances of several common state-of-the-art approaches

    The source ambiguity problem: Distinguishing the effects of grammar and processing on acceptability judgments

    Get PDF
    Judgments of linguistic unacceptability may theoretically arise from either grammatical deviance or significant processing difficulty. Acceptability data are thus naturally ambiguous in theories that explicitly distinguish formal and functional constraints. Here, we consider this source ambiguity problem in the context of Superiority effects: the dispreference for ordering a wh-phrase in front of a syntactically “superior” wh-phrase in multiple wh-questions, e.g., What did who buy? More specifically, we consider the acceptability contrast between such examples and so-called D-linked examples, e.g., Which toys did which parents buy? Evidence from acceptability and self-paced reading experiments demonstrates that (i) judgments and processing times for Superiority violations vary in parallel, as determined by the kind of wh-phrases they contain, (ii) judgments increase with exposure, while processing times decrease, (iii) reading times are highly predictive of acceptability judgments for the same items, and (iv) the effects of the complexity of the wh-phrases combine in both acceptability judgments and reading times. This evidence supports the conclusion that D-linking effects are likely reducible to independently motivated cognitive mechanisms whose effects emerge in a wide range of sentence contexts. This in turn suggests that Superiority effects, in general, may owe their character to differential processing difficulty

    Status Report of the DPHEP Study Group: Towards a Global Effort for Sustainable Data Preservation in High Energy Physics

    Full text link
    Data from high-energy physics (HEP) experiments are collected with significant financial and human effort and are mostly unique. An inter-experimental study group on HEP data preservation and long-term analysis was convened as a panel of the International Committee for Future Accelerators (ICFA). The group was formed by large collider-based experiments and investigated the technical and organisational aspects of HEP data preservation. An intermediate report was released in November 2009 addressing the general issues of data preservation in HEP. This paper includes and extends the intermediate report. It provides an analysis of the research case for data preservation and a detailed description of the various projects at experiment, laboratory and international levels. In addition, the paper provides a concrete proposal for an international organisation in charge of the data management and policies in high-energy physics

    Supporting Emirati females leadership skills through teaching them how to debate: Design, assessment, and considerations

    Get PDF
    © 2016 Elsevier Ltd. In response to the emerging need in the United Arab Emirates to empower young women and prepare them for future leadership tasks, a debate teaching intervention was organized in two phases at a public University in Dubai. During that intervention, 137 female Emirati students were taught the basics of debate and then participated in a debate session on a topic of general interest (Dubai EXPO 2020). Results show that participants observe a clear change in how they perceive themselves as leaders as a result of the intervention. Moreover, their leadership discourse as measured in terms of the persuasiveness of their expressed arguments at a group level was seen to improve more when the debate format followed had a formal structure than when it was flexible. Implications are discussed regarding the transformative learning function of debate as a training tool and its effect on leadership self-efficacy

    Critical Exponents, Hyperscaling and Universal Amplitude Ratios for Two- and Three-Dimensional Self-Avoiding Walks

    Get PDF
    We make a high-precision Monte Carlo study of two- and three-dimensional self-avoiding walks (SAWs) of length up to 80000 steps, using the pivot algorithm and the Karp-Luby algorithm. We study the critical exponents ν\nu and 2Δ4γ2\Delta_4 -\gamma as well as several universal amplitude ratios; in particular, we make an extremely sensitive test of the hyperscaling relation dν=2Δ4γd\nu = 2\Delta_4 -\gamma. In two dimensions, we confirm the predicted exponent ν=3/4\nu = 3/4 and the hyperscaling relation; we estimate the universal ratios  / =0.14026±0.00007\ / \ = 0.14026 \pm 0.00007,  / =0.43961±0.00034\ / \ = 0.43961 \pm 0.00034 and Ψ=0.66296±0.00043\Psi^* = 0.66296 \pm 0.00043 (68\% confidence limits). In three dimensions, we estimate ν=0.5877±0.0006\nu = 0.5877 \pm 0.0006 with a correction-to-scaling exponent Δ1=0.56±0.03\Delta_1 = 0.56 \pm 0.03 (subjective 68\% confidence limits). This value for ν\nu agrees excellently with the field-theoretic renormalization-group prediction, but there is some discrepancy for Δ1\Delta_1. Earlier Monte Carlo estimates of ν\nu, which were  ⁣0.592\approx\! 0.592, are now seen to be biased by corrections to scaling. We estimate the universal ratios  / =0.1599±0.0002\ / \ = 0.1599 \pm 0.0002 and Ψ=0.2471±0.0003\Psi^* = 0.2471 \pm 0.0003; since Ψ>0\Psi^* > 0, hyperscaling holds. The approach to Ψ\Psi^* is from above, contrary to the prediction of the two-parameter renormalization-group theory. We critically reexamine this theory, and explain where the error lies.Comment: 87 pages including 12 figures, 1029558 bytes Postscript (NYU-TH-94/09/01

    Search for Kaluza-Klein Graviton Emission in ppˉp\bar{p} Collisions at s=1.8\sqrt{s}=1.8 TeV using the Missing Energy Signature

    Get PDF
    We report on a search for direct Kaluza-Klein graviton production in a data sample of 84 pb1{pb}^{-1} of \ppb collisions at s\sqrt{s} = 1.8 TeV, recorded by the Collider Detector at Fermilab. We investigate the final state of large missing transverse energy and one or two high energy jets. We compare the data with the predictions from a 3+1+n3+1+n-dimensional Kaluza-Klein scenario in which gravity becomes strong at the TeV scale. At 95% confidence level (C.L.) for nn=2, 4, and 6 we exclude an effective Planck scale below 1.0, 0.77, and 0.71 TeV, respectively.Comment: Submitted to PRL, 7 pages 4 figures/Revision includes 5 figure

    Measurement of the average time-integrated mixing probability of b-flavored hadrons produced at the Tevatron

    Get PDF
    We have measured the number of like-sign (LS) and opposite-sign (OS) lepton pairs arising from double semileptonic decays of bb and bˉ\bar{b}-hadrons, pair-produced at the Fermilab Tevatron collider. The data samples were collected with the Collider Detector at Fermilab (CDF) during the 1992-1995 collider run by triggering on the existence of μμ\mu \mu and eμe \mu candidates in an event. The observed ratio of LS to OS dileptons leads to a measurement of the average time-integrated mixing probability of all produced bb-flavored hadrons which decay weakly, χˉ=0.152±0.007\bar{\chi} = 0.152 \pm 0.007 (stat.) ±0.011\pm 0.011 (syst.), that is significantly larger than the world average χˉ=0.118±0.005\bar{\chi} = 0.118 \pm 0.005.Comment: 47 pages, 10 figures, 15 tables Submitted to Phys. Rev.
    corecore